Speaker Identification Using Admissible Wavelet Packet Based Decomposition
نویسندگان
چکیده
Mel Frequency Cepstral Coefficient (MFCC) features are widely used as acoustic features for speech recognition as well as speaker recognition. In MFCC feature representation, the Mel frequency scale is used to get a high resolution in low frequency region, and a low resolution in high frequency region. This kind of processing is good for obtaining stable phonetic information, but not suitable for speaker features that are located in high frequency regions. The speaker individual information, which is non-uniformly distributed in the high frequencies, is equally important for speaker recognition. Based on this fact we proposed an admissible wavelet packet based filter structure for speaker identification. Multiresolution capabilities of wavelet packet transform are used to derive the new features. The proposed scheme differs from previous wavelet based works, mainly in designing the filter structure. Unlike others, the proposed filter structure does not follow Mel scale. The closed-set speaker identification experiments performed on the TIMIT database shows improved identification performance compared to other commonly used Mel scale based filter structures using wavelets. Keywords—Speaker identification, Wavelet transform, Feature extraction, MFCC, GMM.
منابع مشابه
New Filter Structure based on Admissible Wavelet Packet Transform for Text-Independent Speaker Identification
Identical acoustic features like Mel frequency cepstral Coefficients (MFCC)and Linear predictive cepstral coefficients (LPCC) are being widely used for different tasks like speech recognition and speaker recognition, whereas the requirement of speaker recognition is different than that of speech recognition. In MFCC feature representation, the Mel frequency scale is used to get a high resolutio...
متن کاملWavelet Packet Transform Features with Application to Speaker Identification
This study proposes a new set of feature parameters based on wavelet packet transform analysis of the speech signal. The new speech features are named subband based cepstral parameters (SBC) and wavelet packet parameters (WPP). The ability of each parameter set to capture speaker identity conveyed in the speech signal is compared to the widely used Mel-frequency cepstral coee-cents (MFCC). The ...
متن کاملRobust Digital Speech Watermarking For Online Speaker Recognition
A robust and blind digital speech watermarking technique has been proposed for online speaker recognition systems based on Discrete Wavelet Packet Transform (DWPT) and multiplication to embed the watermark in the amplitudes of the wavelet’s subbands. In order to minimize the degradation effect of the watermark, these subbands are selected where less speaker-specific information was available (5...
متن کاملA Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification
One of the most widely used approaches for feature extraction in speaker recognition is the filter bank-based Mel Frequency Cepstral Coefficients (MFCC) approach. The main goal of feature extraction in this context is to extract features from raw speech that captures the unique characteristics of a particular individual. During the feature extraction process, the discrete Fourier transform (DFT...
متن کاملOverlapping wavelet packet features for speaker verification
A generalization of the Discrete Wavelet Packet Transform (DWPT), referred to as Overlapping Discrete Wavelet Packet Transform (ODWPT), is proposed. In contrast to the traditional DWPT, the ODWPT assumes overlapping among the frequency sub-bands at various levels of the transform. Based on this overlapping strategy, a new set of speech features that is specially designed for speaker recognition...
متن کامل